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Abstract 

We consider the random walk attachment graph introduced by Saramaki and Kaski and 
proposed as a mechanism to explain how behaviour similar to preferential attachment may 
appear requiring only local knowledge. We show that if the length of the random walk is fixed 
then the resulting graphs can have properties significantly different from those of preferential 
attachment graphs, and in particular that in the case where the random walks are of length 
1 and each new vertex attaches to a single existing vertex the proportion of vertices which 
have degree 1 tends to 1, in contrast to preferential attachment models. 
AMS 2010 Subject Classification: Primary 05C82. 
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1 Introduction 



There is currently great interest in the preferential attachment model of network growth, usually 
called the Barabasi- Albert [p], [jj model, though it dates back at least to Yule p^| , and was 
discussed also by Simon (llj. In the simplest version of this an existing graph (network) is 
incremented at each stage by adding a single new vertex which then attaches to a single pre- 
existing vertex, and this latter is chosen, from amongst those of the pre-existing graph, with 
probability proportional to the degree of that vertex. In the Barabasi-Albert model the new 
vertex will connect to some fixed m vertices but here we only consider the case m = 1. One of 
the best known properties of the model is that it produces a power law degree distribution, as 
shown rigorously by Bollobas et al ||. A generalised model of this approach (the "M-model") 
was presented by Chrysafis and Cannings wm. 
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One weakness of this model and its generalisations is that this implicitly requires a calculation 
across all the existing vertices, or at least a knowledge of the total degree (sum of the vertex 
degrees) of the graph. This requirement then destroys the potential for this model to have 
emergent properties from local behaviour. 



A possible solution to this was proposed by Saramaki and Kaski [10|. In their model the new 
vertex simple chooses a single vertex from the graph and then executes a random walk of length 
£ step initiated from that vertex. Saramaki and Kaski [^] and Evans and Saramaki claim 
that this reproduces the Barabasi- Albert degree distribution, even when £ = 1. It is clear that 
this is the case if the random walk is run for long enough to have converged to its stationary 
distribution. However, as we shall prove in the particular case 1 = 1 the degree sequence does 
not converge to a power law distribution but rather to a degenerate limiting distribution in 
which almost every vertex has degree 1. 



2 The Model 



Let Go be an arbitrary (perhaps connected) graph. Form G n +i from G n by adding a single 
vertex. This vertex chooses a single vertex (i.e. m = 1) to connect to by picking a vertex 
uniformly at random in G n and then, conditional on the vertex chosen, performing a random 
walk of length £ on G n , starting from a randomly chosen vertex, and then choosing to connect 
to the destination vertex. 

(n) 

We first consider the number of leaves in the graph. Let p d be the proportion of vertices in 

(n) 

G n with degree d, and let L n = p\ , i.e. the proportion of leaves. The number of edges in G n 
will be n + eo, the total degree will thus be 2(n + eo), and the number of vertices will be n + vq, 
where eo and vq have obvious meanings. Let V n be the vertex initially chosen at random at step 
n, and let W n be the vertex selected by the random walk, so the new vertex connects to W n . 
We now prove the main result. 

Theorem 1. When I = 1, as n — > oo, L n — >■ 1, almost surely. 



Proof. We assume that Go is not a star. If Go is a star, then it is clear that, with probability 
1, G n will eventually not be a star, so we can just wait until this happens and re-label the first 
non-star graph as Go- Then, for all n, d, a vertex of degree d in G n has at least one neighbour 
which is not a leaf, and in particular no leaves have a leaf as their neighbour. If V n is a leaf, 
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which has probability L n , then W n will be one of its neighbours, which will not be a leaf, so in 
this case the number of leaves increases by 1. Hence 

E((n + l)L n+ i|G n ) > nL n + L n = (n + l)L n , (1) 

and so E(L n+ i\G n ) > L n and so (L n ) n( =N is a submartingale taking values in [0,1], and thus 
converges almost surely to a limit, which we call L^. 



To show that = 1, note that conditional on V n having degree d then the probability of W n 
not being a leaf is at least 1/d, so we can make (|l]) sharper, getting 

oo (n) 

£(W|G„)>L n + ]T-^— . (2) 



The total degree of non-leaves in G n is (2 — L n )n + riQ, and the number of non-leaves is (1 — L n )n, 
so the average degree of non-leaves is jEjf' + • Hence if ^ 1, for n large enough and for 

any A > at least half the non- leaves have degree at most 2y5^- So, there exists N such that 
ifn>iV, 

E(L n+1 \G n ) >L n+ 2 2(n + 1)(2 _ A) - (3) 



Hence 

\2 



E(L„|G„) > EiL N ) + ^ — , (4) 

v ; fc=JV 

which implies that £^(L n |G n ) > 1 for large enough n, a contradiction. Hence = 1. □ 



3 Go Bipartite 



We now consider a special case, which demonstrates that for all odd £, the random walk model 
of (T(| differs fundamentally from that of the Barabasi- Albert model. 

Assume that Go is a bipartite graph, with the two parts coloured as red and blue. Then, in both 
models, for all n G n will be bipartite, and the parts can be coloured red and blue consistently 
for each n. Let the proportion of red vertices in G n be R n . We begin with the random walk 
model, 
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Theorem 2. We have Roq such that R n converges almost surely to Roo- If £ is even, then 
Roo = \, almost surely, while if £ is odd R^ is a random variable with a Beta distribution. 

Proof. Conditional on G n , V n will be red with probability R n . If £ is odd W n will be of opposite 
colour to V n , which implies that the new vertex (which connects to W n ) will be of the same colour 
as V n , and thus, conditional on G n , will be red with probability R n and blue with probability 
1 — R n . Hence in this case the colours of vertices are equivalent to the colours of the balls in a 
standard Polya urn (where when a ball is drawn two of the same colour are returned), and so 
R n converges almost surely to R^ where R^ has a Beta distribution whose parameters depend 
on Go- 

If £ is even then W n is of the same colour as V n and so the new vertex is of opposite colour to 
V n . Hence this case corresponds to a two-colour generalised Polya urn where a ball is selected 
and a ball of the opposite colour is added, namely a Friedman urn with a = and (3 = 1. In 
this case R^ — > \ almost surely, see for example Freedman ||. □ 

Theorem 3. In the Barabdsi- Albert model R^ = \ almost surely. 

Proof. In this model it is possible to associate the selection of a vertex with an urn model by 
considering half-edges, and giving each half-edge the colour of its associated vertex, i.e. each edge 
is split into a red half and a blue half. The selection of a vertex with probability proportional 
to its degree is then equivalent to selecting a half-edge uniformly at random and then selecting 
the associated vertex. As the new edge added in will always consist of a blue half and a 

red half, the proportion of red half-edges must converge to ^, and as a red vertex is added if 
and only if a blue vertex is selected, the proportion of red vertices will converge to i, almost 
surely. □ 

So in this respect the behaviour of the random walk model is different from the Barabasi- Albert 
model when £ is odd, regardless of the size of i. 



4 Discussion 

We have demonstrated that the model of Saramaki and Kaski is fundamentally different from 
that of Barabasi and Albert, unless we allow an indefinite length for the random walk component. 
It does have the advantage of not requiring a global calculation, retaining the local behaviour 
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characteristic which is desirable in models of emergent behaviour. An alternate approach might 
be to imagine that the addition of edges is affected by the vertices in G n , rather than by the 
new vertex. Thus each vertex in G n could link to a new vertex as it arises with probability 
proportional to its degree, independently all other vertices, as in the variant of preferential 
attachment studied by Dereich and Morters H |6|. This, of course, destroys one of the usual 
assumptions of the preferential attachment model that the number of new links is some fixed 
value m, though we could substitute the condition that the average number added was fixed. 

The urn model approach is interesting particularly since there is much known about these (see 
for example the survey paper by Pemantle |]]). We might generalise the model to consider 
directed graphs where there are k colours q; i = 0, k — 1, with directed edges only between 
a vertex of colour Cj and one of colour C(j +1 )( m0( ^ ja. When a new vertex is added it links at 
random to a vertex and then takes I random steps along directed edges, its colour then being 
determined. The case I ^ 0(mod k) will have the proportions of each colour converging to 1/k, 
whereas for t = 0(mod k) there will be a Dirichlet distribution with parameters depending on 

Gq. 
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